Improving speaker de-identification with functional data analysis of f0 trajectories
نویسندگان
چکیده
Due to a constantly increasing amount of speech data that is stored in different types databases, voice privacy has become major concern. To respond such concern, researchers have developed various methods for speaker de-identification. The state-of-the-art solutions utilize deep learning which can be effective but might unavailable or impractical apply for, example, under-resourced languages. Formant modification simpler, yet method de-identification requires no training data. Still, remaining intonational patterns formant-anonymized may contain speaker-dependent cues. This study introduces novel method, which, addition simple formant shifts, manipulates f0 trajectories based on functional analysis. proposed will conceal plausibly identifying pitch characteristics phonetically controllable manner and improve formant-based up 25%.
منابع مشابه
Functional data analysis of juggling trajectories
Scope The Dutch/Flemish Classification Society, VOC, aims at communicating scientific principles, methods, and applications of ordination and classification. The VOC is a member of the International Federation of Classification Societies (IFCS). Profile Analysis: A complementary way of reporting results in large assessments 14.40 Lieke Voncken Continuous norming of psychological tests: A compar...
متن کاملIdentification of mild cognitive impairment disease using brain functional connectivity and graph analysis in fMRI data
Background: Early diagnosis of patients in the early stages of Alzheimer's, known as mild cognitive impairment, is of great importance in the treatment of this disease. If a patient can be diagnosed at this stage, it is possible to treat or delay Alzheimer's disease. Resting-state functional magnetic resonance imaging (fMRI) is very common in the process of diagnosing Alzheimer's disease. In th...
متن کاملJoint analysis of f0 and speech rate with Functional Data Analysis
In this work we propose the use of Functional Data Analysis (FDA) as a powerful methodology to tackle problems where multiple continuous speech parameters have to be analyzed jointly. A production study on contrastive focus placement in Neapolitan Italian is used as illustration. Two features are analyzed, viz. f0 and relative speech rate, both expressed as continuous functions of time. The res...
متن کاملLimited data speaker identification
In this paper, the task of identifying the speaker using limited training and testing data is addressed. Speaker identification system is viewed as four stages namely, analysis, feature extraction, modelling and testing. The speaker identification performance depends on the techniques employed in these stages. As demonstrated by different experiments, in case of limited training and testing dat...
متن کاملImproving speaker segmentation via speaker identification and text segmentation
Speaker segmentation is an essential part of a speaker diarization system. Common segmentation systems usually miss speaker change points when speakers switch fast. These errors seriously confuse the following speaker clustering step and result in high overall speaker diarization error rates. In this paper two methods are proposed to deal with this problem: The first approach uses speaker ident...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Speech Communication
سال: 2022
ISSN: ['1872-7182', '0167-6393']
DOI: https://doi.org/10.1016/j.specom.2022.03.010